Sparsification Enables Predicting Kissing Hairpin Pseudoknot Structures of Long RNAs in Practice
نویسندگان
چکیده
While computational RNA secondary structure prediction is an important tool in RNA research, it is still fundamentally limited to pseudoknot-free structures (or at best very simple pseudoknots) in practice. Here, we make the prediction of complex pseudoknots – including kissing hairpin structures – practically applicable by reducing the originally high space consumption. For this aim, we apply the technique of sparsification and other space-saving modifications to the recurrences of the pseudoknot prediction algorithm by Chen, Condon and Jabbari (CCJ algorithm). Thus, the theoretical space complexity of free energy minimization is reduced to Θ(n3 + Z), in the sequence length n and the number of non-optimally decomposable fragments (“candidates”) Z. The sparsified CCJ algorithm, sparseCCJ, is presented in detail. Moreover, we provide and compare three generations of CCJ implementations, which continuously improve the space requirements: the original CCJ implementation, our first modified implementation, and our final sparsified implementation. The two latest implementations implement the established HotKnots DP09 energy model. In our experiments, using 244GB of RAM, the original CCJ implementation failed to handle sequences longer than 195 bases; sparseCCJ handles our pseudoknot data set (up to about length 400 bases) in this space limit. All three CCJ implementations are available at https://github.com/HosnaJabbari/CCJ. 1998 ACM Subject Classification J.3 Life and Medical Sciences
منابع مشابه
Heuristic RNA pseudoknot prediction including intramolecular kissing hairpins.
Pseudoknots are an essential feature of RNA tertiary structures. Simple H-type pseudoknots have been studied extensively in terms of biological functions, computational prediction, and energy models. Intramolecular kissing hairpins are a more complex and biologically important type of pseudoknot in which two hairpin loops form base pairs. They are hard to predict using free energy minimization ...
متن کاملPredicting RNA pseudoknot folding thermodynamics
Based on the experimentally determined atomic coordinates for RNA helices and the self-avoiding walks of the P (phosphate) and C4 (carbon) atoms in the diamond lattice for the polynucleotide loop conformations, we derive a set of conformational entropy parameters for RNA pseudoknots. Based on the entropy parameters, we develop a folding thermodynamics model that enables us to compute the sequen...
متن کاملKissing of the two predominant hairpin loops in the coxsackie B virus 3' untranslated region is the essential structural feature of the origin of replication required for negative-strand RNA synthesis.
Higher-order RNA structures in the 3' untranslated region (3'UTR) of enteroviruses are thought to play a pivotal role in viral negative-strand RNA synthesis. The structure of the 3'UTR was predicted by thermodynamic calculations using the STAR (structural analysis of RNA) computer program and experimentally verified using chemical and enzymatic probing of in vitro-synthesized RNA. A possible ps...
متن کاملCCG-Based RNA Secondary Structure Prediction
Various systems have been proposed to predict secondary structures of RNAs using their sequence information. Among them, Uemura et al. [2] described a system that recognizes some typical RNA secondary structures such as hairpin loops and pseudoknots with Tree Adjoining Grammar. However, their work captures only known sub-structures, and not those unknown sub-structures that might also exist. Te...
متن کاملIntramolecular secondary structure rearrangement by the kissing interaction of the Neurospora VS ribozyme.
Kissing interactions in RNA are formed when bases between two hairpin loops pair. Intra- and intermolecular kissing interactions are important in forming the tertiary or quaternary structure of many RNAs. Self-cleavage of the wild-type Varkud satellite (VS) ribozyme requires a kissing interaction between the hairpin loops of stem-loops I and V. In addition, self-cleavage requires a rearrangemen...
متن کامل